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Abstract 

We prove a "supersaturation-type" extension of both Sperner's Theorem (1928) and its gen- 
eralization by Erdos (1945) to fc-chains. Our result implies that a largest family whose size is x 
more than the size of a largest k- chain free family and that contains the minimum number of 
fc-chains is the family formed by taking the middle {k - 1) rows of the Boolean lattice and x ele- 
ments from the A;th middle row. We prove our result using the symmetric chain decomposition 
U: method of de Bruijn, van Ebbenhorst Tengbergen, and Kruyswijk (1951). 
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1 Introduction 

A core topic of extremal graph theory is the study of "Turan-type questions": fix a (finite) graph H 
and a positive integer n. What is the largest number ex(n, H) of edges in an n-vertex graph that 
contains no copy of HI More than a hundred years ago, Mantel HU answered this question in the 
case where H is K 3 , the triangle. About forty years later, Turan 1 19] generalized this to all complete 

di- 



graphs. More precisely, the Turdn graph T(n, r) is the complete r-partite graph of order n with 
parts of size [nl r\ or \nl r] . Not only did Turan prove that T{n, r) has the largest number of edges 
among all rc-vertex graphs with no copies of K r+ \, that is, ex(n, r) = \E{T{n, r))|, but also he proved 
that all other rc-vertex graphs containing no copies of K^+i have strictly fewer edges than T[n, r). 

The theory of graph supersaturation deals with the situation beyond the threshold given by 
ex(n,H). Specifically, define £{n, H, q) as the least number of copies of H in an n-vertex graph 
with at least ex(n, H) + q edges. By the definition, we know that £{n, H,q)^l as soon as q > 0, but 
it turns out that an extra edge is likely to create many more copies of H. Arguably, the first result in 
this direction was proved in an unpublished work of Rademacher from 1 94 1 (orally communicated 
to Erdos |3'|): while Mantel's theorem states that every n-vertex graph with more than \E{T(n,2))\ = 
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[n/2\ \n/2] edges contains a triangle, Rademacher established that such graphs contain, actually, 
at least [n/2\ triangle copies. 

This result was generalized by Erdos, who proved that £{n,K 3 ,q) ^ \_n/2\ first if q £ {1,2,3} 
in 1955 [3] and a few years later in the case q < c ■ n/2 for a fixed constant ce (0,1) @). More 
than twenty years later, Lovasz and Simonovits 1 13] established the following theorem, thereby 
confirming a conjecture of Erdos. 

Theorem 1.1 (Lovasz and Simonovits (1983)). Let n and q be positive integers. If q < n/2, then 
£{n,K 3 ,q)>q-[n/2\. 

In addition, Lovasz and Simonovits [13] determined £{n,K r , q) when q = o[n 2 ). Their techniques 
do not apply, though, for the case where q = D.{n 2 ). Solutions to this difficult problem were pro- 
vided recently with the aid of flag algebras: first, by Razborov 1 16] for H = K 3 , then, by Nikiforov H5] 
for H = K 4 and, finally, by Reiher 1 17] for the general case H = K r . 

Supersaturation results have not to our knowledge been studied as extensively in other im- 
portant areas of extremal combinatorics. In this paper, we pursue this direction for extremal set 
theory. 

Let the Boolean lattice SS n be the poset (2 [nl , Q) of all subsets of the set [n] = {I,..., n}, ordered 
by inclusion. For a set S, the collection of all fc-subsets of S is denoted by (^). Following notation 
of previous work 1 10] , by £$(n, k) and Z(n, k) we mean the families of subsets of [n] of the k middle 
sizes and the size of the families. More precisely, 
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or SB{n,k) = 
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(so, depending on the parity of n and k, this can be either one or two different families). 

Given two finite posets P = (P,^) and P' = CP', «;'), we say that P' contains P (or P is a (weak) 
subposet ofP'), if there is an injection f : P —> P' that preserves the partial ordering, i.e. if u ^ v in 
P, then f{u) f{v) in P'. We let ^ be the fc-element totally ordered poset (chain). 

What is the largest size La(n,P) of a family of subsets of [n] that does not contain PI The 
foundational result of this kind, Sperner's Theorem [TH from 1928, answers this question for a 
two-element chain: La(n,5 8 2) = Z(n, 1). Moreover, the value La(n,3 a 2) is attained only by SM{n, 1), 
which consists of subsets all of (a) middle size. Erdos |2 | almost two decades later generalized this 
to P = gPjc, showing that La(«,5 a / t) = k - 1), which is attained only by S8{n, k - 1). Katona has 
championed the La(n, P) problem for posets P other than A;-chains, and this is a challenging area 
of extremal set theory. It is often very difficult to obtain the extremal size La(n, P) of such a family, 
even asymptotically (see 1 10] for a survey). 

However, for those P for which we know the exact threshold, we can ask how many copies of 
P must be present in families larger than the threshold La(rc, P). Here we investigate the simplest 
instance of this problem, when P is a chain. Analogous to the way that Rademacher and Erdos 
(and subsequent researchers) have extended the theorems of Mantel and Turan, we present a su- 
persaturation extension of Sperner's Theorem and its fc-chain generalization by Erdos. 

Our initial result was a lower bound on the number of in a family F Q 2 [wl of a given size 
that is optimal for \F\ ^ Z(n,2), extending Sperner's Theorem. By investigating more examples, we 
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came to believe that for any size |F|, with £(«, £) ^ |F| ^ Z(n, £ + 1), the number of in F is min- 
imized by taking F to consist of SB{n, £) together with subsets of£8(n,£ + l).ln further exploration 
of problems related to poset-free families of subsets, we came across the work of Kleitman [TT] 
from 1968, which corroborates our findings and intuition. Indeed, Kleitman, albeit with match- 
ing theory techniques, obtained the (same) supersaturation extension of Sperner's Theorem and 
more. This settled a conjecture of Erdos and Katona. In particular, he determined the minimum 
number of pairs (A, B) with AcBina family F c 2 [wl of any given size. As we had intuited, taking 
the subsets of some middle sizes attains the optimum. 

One particularly nice way to quickly derive Sperner's Theorem and its generalization by Erdos 
is to employ the remarkable symmetric chain decomposition (SCD, for short) of all 2" subsets 
of [n], discovered by de Bruijn, van Ebbenhorst Tengbergen, and Kruyswijk Q] in 1951. It is a 
partition of the Boolean lattice into just ( Lw " 2 j) disjoint chains of subsets, where for each chain 
there is some k nil such that the chain consists of a subset of each size from k to n - k. For all k 
the decomposition induces the best possible upper bound on |F| for a ^-free family F of subsets 
of[n]. (It requires some additional arguments to obtain the extremal families.) The construction, 
which is obtained by a clever inductive argument, was done originally in the more general setting 
of a product of chains. In this way the authors obtained the extension of Sperner's Theorem to the 
lattice of divisors of an integer N. 

There is a large literature on the existence of SCDs in posets and other ordered /ranked set 
systems (6l[71[T2]. Greene and Kleitman |5| discovered an explicit SCD of the Boolean lattice for 
all n, based on a simple "bracketing procedure" , as opposed to the original inductive construction. 
Bracketing has proven to be valuable in its own right, such as for the Littlewood-Offord problem (6) 
and for the construction of symmetric Venn diagrams on n sets for all prime n |9) . 

It is not surprising then that a SCD of £& n yields a lower bound on the number of paths in 
a family F of given size. In particular, if we arbitrarily consider one particular SCD, the number 
of chains in F that are also chains in the SCD is minimized by taking the sets of F to be of the 
middle sizes. However, this argument does not account for the many containment relations for 
pairs of subsets Ac B where A and B are on different chains in the SCD. To adjust for this, and 
to exploit symmetry by avoiding bias towards a particular SCD, our new idea here is to take all n\ 
SCDs obtained by permutation of the ground set [n]. In this way we obtain lower bounds on the 
number of paths in a family F of given size, bounds that are best possible for small F. 

Our main aim in this paper is then to prove the following supersaturation extension of the 
theorems of Sperner and Erdos, using the above-outlined SCD approach. 

Theorem 1.2. If a family F of subsets of[n] satisfies \F\ = x + H{n, k - 1), then there must be at least 
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copies ofS^k in F. 
Note that 
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is the number of copies of g? k contained in 38{n, k), with one endpoint of the chain being a partic- 
ular set in the /cth middle row. Thus the family that consists of 38{n, k - 1) and x sets from the /cth 
middle row witnesses that the above bound is tight for 



n 



More generally, Kleitman HU has conjectured that for any k the natural construction (that se- 
lects subsets around the middle) minimizes the number of chains 3? k in F. Our result gives new 
information in support of this conjecture, verifying it for \F\ s; k). We suspect that a stronger 
version of our SCD method, in which weights are assigned, may lead to a proof of Kleitman's con- 
jecture in full for general k. So far our efforts in this direction that looked very promising have not 
yet succeeded. We cannot imagine that his conjecture is not correct. 



2 Proof of Theorem 1 .2 



As mentioned earlier, we shall use the symmetric chain decomposition of 38 n . 

Proof of Theorem Tna Given a poset (P, <) on 2 [n \ let us say that a A;-chain A\ e • •• c A k of F (in 
38 n ) is included in P if A\ < ■• ■ < A k , and furthermore define cf CP) to be the number of fc-chains 
of F included in P. For any SCD ^ of 38 n , let Ptg be the poset on 2 [wl defined by taking the disjoint 
union of the chains in c €. Let us fix the SCD c €. By the pigeonhole principle, P^ includes at least 
x fc-chains of F, i.e. Cf(P^) 5= x. Each (non-trivial) permutation n of [n] applied to ^€ results in a 
new unique SCD n^€) for 38 n . Note that n^) ^ n'^) for distinct permutations n and n' of [n] . By 
summing over the permutations n of [n] , we obtain 

n!-x^^c f (P w( ^)). 
ji 

Let us change the summation to sum over all A;-chains of F. For this, we define N{n, A\,..., A^) to 
be the number of permutations n such that Pn^) includes a given chain A\ c ■ • • c A^ of F. We 
obtain 

E c f^m)= E N{n,A l ,...,A k ). 

n A\c—cA k ;Ai€F 

Setting at := \Af \ for each i e {1, . , ., k}, it holds that 



N(n, Ai,...,A k ) = ai\-{a 2 - ai)\---{a k - a k -i)\ ■ (n- a k )\ ■ min 




where the last factor comes from the number of chains in a SCD that the given chain could fit. After 
some manipulation, we deduce that 
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We shall find a general upper bound for N{n, A\ , . . . , A0 by minimizing the maximum of y defined 
as [£) • • • Q) and z defined as (^J) • • • C""^ 1 ). Note the following binomial identity: 
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As a consequence of this, the values of y and z are invariant as long as the multiset of all differ- 
ences between consecutive values of ai is invariant. By this fact, if there is some difference in this 
multiset that is at least 2, we may assume without loss of generality that this "large" difference is 
between a k -i and a k . It follows that 



y -— ^ * i ai- \ ~ y ' i ai- \ y> 
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provided that a k -\ > 0. Similarly, 
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provided that «2 < n. It follows that y and z are minimized when the multiset of differences is the 
multiset of all ones, i.e. with 
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The maximum of y and z is then minimized by choosing a k to be 
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as required. 
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Remarks 

Our result was presented by the second author in Prague in June 2012 |8|. In the preparation of 
this manuscript, we learned that recently Das, Gan and Sudakov have independently pursued a 
similar line of research and possibly obtained results similar to ours. 
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